Overview

Dataset info

Number of variables32
Number of observations1470
Missing cells2643 (5.6%)
Duplicate rows0 (0.0%)
Total size in memory1.5 MiB
Average record size in memory1.1 KiB

Variables types

CAT17
NUM12
BOOL3

Reproduction info

Date of analysis2023-03-03 16:48:03.396449
Versionpandas-profiling v2.4.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download Configurationconfig.yaml

Warnings

anos_compania has 44 (3.0%) zeros Zeros
anos_con_manager_actual has 263 (17.9%) zeros Zeros
anos_desde_ult_promocion has 581 (39.5%) zeros Zeros
anos_en_puesto has 1238 (84.2%) missing values Missing
conciliacion has 1011 (68.8%) missing values Missing
educacion has 101 (6.9%) missing values Missing
empleados has constant value "1" Rejected
horas_quincena has constant value "80" Rejected
implicacion has 18 (1.2%) missing values Missing
mayor_edad has constant value "Y" Rejected
num_empresas_anteriores has 197 (13.4%) zeros Zeros
num_formaciones_ult_ano has 54 (3.7%) zeros Zeros
satisfaccion_trabajo has 76 (5.2%) missing values Missing
sexo has 199 (13.5%) missing values Missing
salario_mes is highly correlated with nivel_laboralHigh Correlation
nivel_laboral is highly correlated with salario_mesHigh Correlation
sexo is highly correlated with anos_en_puestoHigh Correlation
anos_en_puesto is highly correlated with sexoHigh Correlation
conciliacion is highly correlated with anos_en_puesto and 4 other fieldsHigh Correlation
anos_en_puesto is highly correlated with conciliacion and 4 other fieldsHigh Correlation
educacion is highly correlated with anos_en_puesto and 4 other fieldsHigh Correlation
implicacion is highly correlated with anos_en_puesto and 4 other fieldsHigh Correlation
puesto is highly correlated with departamentoHigh Correlation
departamento is highly correlated with puestoHigh Correlation
satisfaccion_trabajo is highly correlated with anos_en_puesto and 4 other fieldsHigh Correlation
sexo is highly correlated with anos_en_puesto and 4 other fieldsHigh Correlation
mayor_edad is highly correlated with horas_quincenaHigh Correlation
horas_quincena is highly correlated with mayor_edadHigh Correlation

Variables

abandono
Boolean

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
No
1233
Yes
 
237
ValueCountFrequency (%) 
No 1233 83.9%
 
Yes 237 16.1%
 

anos_compania
Real number (ℝ≥0)

ZEROS
Distinct count37
Unique (%)2.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.008163265
Minimum0
Maximum40
Zeros44
Zeros (%)3.0%
Memory size11.6 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile1
Q13
median5
Q39
95-th percentile20
Maximum40
Range40
Interquartile range (IQR)6

Descriptive statistics

Standard deviation6.126525152
Coefficient of variation (CV)0.8741984056
Kurtosis3.935508756
Mean7.008163265
Median Absolute Deviation (MAD)4.471686797
Skewness1.764529454
Sum10302
Variance37.53431044
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 0. 0.5 1.5 4.5 5.5 ... 10.5 11.5 22.5 33.5 40. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5 196 13.3%
 
1 171 11.6%
 
3 128 8.7%
 
2 127 8.6%
 
10 120 8.2%
 
4 110 7.5%
 
7 90 6.1%
 
9 82 5.6%
 
8 80 5.4%
 
6 76 5.2%
 
Other values (27) 290 19.7%
 
ValueCountFrequency (%) 
0 44 3.0%
 
1 171 11.6%
 
2 127 8.6%
 
3 128 8.7%
 
4 110 7.5%
 
ValueCountFrequency (%) 
40 1 0.1%
 
37 1 0.1%
 
36 2 0.1%
 
34 1 0.1%
 
33 5 0.3%
 

anos_con_manager_actual
Real number (ℝ≥0)

ZEROS
Distinct count18
Unique (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.123129252
Minimum0
Maximum17
Zeros263
Zeros (%)17.9%
Memory size11.6 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q12
median3
Q37
95-th percentile10
Maximum17
Range17
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.568136121
Coefficient of variation (CV)0.8653951654
Kurtosis0.1710580839
Mean4.123129252
Median Absolute Deviation (MAD)3.025371836
Skewness0.833450992
Sum6061
Variance12.73159537
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 4.5 ... 7.5 8.5 9.5 13.5 17. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2 344 23.4%
 
0 263 17.9%
 
7 216 14.7%
 
3 142 9.7%
 
8 107 7.3%
 
4 98 6.7%
 
1 76 5.2%
 
9 64 4.4%
 
5 31 2.1%
 
6 29 2.0%
 
Other values (8) 100 6.8%
 
ValueCountFrequency (%) 
0 263 17.9%
 
1 76 5.2%
 
2 344 23.4%
 
3 142 9.7%
 
4 98 6.7%
 
ValueCountFrequency (%) 
17 7 0.5%
 
16 2 0.1%
 
15 5 0.3%
 
14 5 0.3%
 
13 14 1.0%
 

anos_desde_ult_promocion
Real number (ℝ≥0)

ZEROS
Distinct count16
Unique (%)1.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.187755102
Minimum0
Maximum15
Zeros581
Zeros (%)39.5%
Memory size11.6 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q33
95-th percentile9
Maximum15
Range15
Interquartile range (IQR)3

Descriptive statistics

Standard deviation3.222430279
Coefficient of variation (CV)1.472939213
Kurtosis3.612673115
Mean2.187755102
Median Absolute Deviation (MAD)2.34689435
Skewness1.984289983
Sum3216
Variance10.3840569
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 6.5 7.5 15. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 581 39.5%
 
1 357 24.3%
 
2 159 10.8%
 
7 76 5.2%
 
4 61 4.1%
 
3 52 3.5%
 
5 45 3.1%
 
6 32 2.2%
 
11 24 1.6%
 
8 18 1.2%
 
Other values (6) 65 4.4%
 
ValueCountFrequency (%) 
0 581 39.5%
 
1 357 24.3%
 
2 159 10.8%
 
3 52 3.5%
 
4 61 4.1%
 
ValueCountFrequency (%) 
15 13 0.9%
 
14 9 0.6%
 
13 10 0.7%
 
12 10 0.7%
 
11 24 1.6%
 

anos_en_puesto
Categorical

MISSING
HIGH CORRELATION
HIGH CORRELATION
Distinct count5
Unique (%)0.3%
Missing1238
Missing (%)84.2%
Memory size11.6 KiB
3
141
2
54
4
 
25
1
 
12
ValueCountFrequency (%) 
3 141 9.6%
 
2 54 3.7%
 
4 25 1.7%
 
1 12 0.8%
 
(Missing) 1238 84.2%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length3
Mean length3
Min length3
Scatter

anos_experiencia
Real number (ℝ≥0)

Distinct count40
Unique (%)2.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.27959184
Minimum0
Maximum40
Zeros11
Zeros (%)0.7%
Memory size11.6 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile1
Q16
median10
Q315
95-th percentile28
Maximum40
Range40
Interquartile range (IQR)9

Descriptive statistics

Standard deviation7.780781676
Coefficient of variation (CV)0.6898105701
Kurtosis0.9182695366
Mean11.27959184
Median Absolute Deviation (MAD)6.034188533
Skewness1.117171853
Sum16581
Variance60.54056348
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 0. 0.5 1.5 3.5 4.5 ... 10.5 17.5 24.5 33.5 40. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
10 202 13.7%
 
6 125 8.5%
 
8 103 7.0%
 
9 96 6.5%
 
5 88 6.0%
 
7 81 5.5%
 
1 81 5.5%
 
4 63 4.3%
 
12 48 3.3%
 
3 42 2.9%
 
Other values (30) 541 36.8%
 
ValueCountFrequency (%) 
0 11 0.7%
 
1 81 5.5%
 
2 31 2.1%
 
3 42 2.9%
 
4 63 4.3%
 
ValueCountFrequency (%) 
40 2 0.1%
 
38 1 0.1%
 
37 4 0.3%
 
36 6 0.4%
 
35 3 0.2%
 

carrera
Categorical

Distinct count6
Unique (%)0.4%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
Life Sciences
606
Medical
464
Marketing
159
Technical Degree
132
Other
 
82
ValueCountFrequency (%) 
Life Sciences 606 41.2%
 
Medical 464 31.6%
 
Marketing 159 10.8%
 
Technical Degree 132 9.0%
 
Other 82 5.6%
 
Human Resources 27 1.8%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length16
Mean length10.53333333
Min length5
Scatter

conciliacion
Categorical

MISSING
HIGH CORRELATION
Distinct count5
Unique (%)0.3%
Missing1011
Missing (%)68.8%
Memory size11.6 KiB
Alta
257
Media
114
Muy_Alta
60
Baja
 
28
ValueCountFrequency (%) 
Alta 257 17.5%
 
Media 114 7.8%
 
Muy_Alta 60 4.1%
 
Baja 28 1.9%
 
(Missing) 1011 68.8%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length8
Mean length3.553061224
Min length3
Scatter

departamento
Categorical

HIGH CORRELATION
Distinct count3
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
Research & Development
961
Sales
446
Human Resources
 
63
ValueCountFrequency (%) 
Research & Development 961 65.4%
 
Sales 446 30.3%
 
Human Resources 63 4.3%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length22
Mean length16.54217687
Min length5
Scatter

distancia_casa
Real number (ℝ≥0)

Distinct count29
Unique (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.192517007
Minimum1
Maximum29
Zeros0
Zeros (%)0.0%
Memory size11.6 KiB
Mini histogram

Quantile statistics

Minimum1
5-th percentile1
Q12
median7
Q314
95-th percentile26
Maximum29
Range28
Interquartile range (IQR)12

Descriptive statistics

Standard deviation8.106864436
Coefficient of variation (CV)0.8818982254
Kurtosis-0.2248334049
Mean9.192517007
Median Absolute Deviation (MAD)6.572742839
Skewness0.9581179957
Sum13513
Variance65.72125098
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 1. 1.5 2.5 10.5 28.5 29. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2 211 14.4%
 
1 208 14.1%
 
10 86 5.9%
 
9 85 5.8%
 
3 84 5.7%
 
7 84 5.7%
 
8 80 5.4%
 
5 65 4.4%
 
4 64 4.4%
 
6 59 4.0%
 
Other values (19) 444 30.2%
 
ValueCountFrequency (%) 
1 208 14.1%
 
2 211 14.4%
 
3 84 5.7%
 
4 64 4.4%
 
5 65 4.4%
 
ValueCountFrequency (%) 
29 27 1.8%
 
28 23 1.6%
 
27 12 0.8%
 
26 25 1.7%
 
25 25 1.7%
 

edad
Real number (ℝ≥0)

Distinct count43
Unique (%)2.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36.92380952
Minimum18
Maximum60
Zeros0
Zeros (%)0.0%
Memory size11.6 KiB
Mini histogram

Quantile statistics

Minimum18
5-th percentile24
Q130
median36
Q343
95-th percentile54
Maximum60
Range42
Interquartile range (IQR)13

Descriptive statistics

Standard deviation9.135373489
Coefficient of variation (CV)0.2474114564
Kurtosis-0.4041451372
Mean36.92380952
Median Absolute Deviation (MAD)7.409795918
Skewness0.4132863019
Sum54278
Variance83.45504879
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[18. 23.5 25.5 28.5 36.5 42.5 46.5 55.5 60. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
35 78 5.3%
 
34 77 5.2%
 
36 69 4.7%
 
31 69 4.7%
 
29 68 4.6%
 
32 61 4.1%
 
30 60 4.1%
 
33 58 3.9%
 
38 58 3.9%
 
40 57 3.9%
 
Other values (33) 815 55.4%
 
ValueCountFrequency (%) 
18 8 0.5%
 
19 9 0.6%
 
20 11 0.7%
 
21 13 0.9%
 
22 16 1.1%
 
ValueCountFrequency (%) 
60 5 0.3%
 
59 10 0.7%
 
58 14 1.0%
 
57 4 0.3%
 
56 14 1.0%
 

educacion
Categorical

MISSING
HIGH CORRELATION
Distinct count5
Unique (%)0.3%
Missing101
Missing (%)6.9%
Memory size11.6 KiB
Universitaria
814
Secundaria
348
Master
 
130
Primaria
 
77
ValueCountFrequency (%) 
Universitaria 814 55.4%
 
Secundaria 348 23.7%
 
Master 130 8.8%
 
Primaria 77 5.2%
 
(Missing) 101 6.9%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length13
Mean length10.72176871
Min length3
Scatter

empleados
Boolean

CONST
Distinct count1
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
1
1470
ValueCountFrequency (%) 
1 1470 100.0%
 

estado_civil
Categorical

Distinct count3
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
Married
673
Single
470
Divorced
327
ValueCountFrequency (%) 
Married 673 45.8%
 
Single 470 32.0%
 
Divorced 327 22.2%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length8
Mean length6.902721088
Min length6
Scatter

evaluacion
Categorical

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
Alta
1244
Muy_Alta
 
226
ValueCountFrequency (%) 
Alta 1244 84.6%
 
Muy_Alta 226 15.4%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length8
Mean length4.614965986
Min length4
Scatter
Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
No
1054
Yes
416
ValueCountFrequency (%) 
No 1054 71.7%
 
Yes 416 28.3%
 

horas_quincena
Categorical

CONST
HIGH CORRELATION
Distinct count1
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
80
1470
ValueCountFrequency (%) 
80 1470 100.0%
 

Composition

Contains charsFalse
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length2
Mean length2
Min length2
Scatter

id
Real number (ℝ≥0)

UNIQUE
Distinct count1470
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1024.865306
Minimum1
Maximum2068
Zeros0
Zeros (%)0.0%
Memory size11.6 KiB
Mini histogram

Quantile statistics

Minimum1
5-th percentile96.45
Q1491.25
median1020.5
Q31555.75
95-th percentile1967.55
Maximum2068
Range2067
Interquartile range (IQR)1064.5

Descriptive statistics

Standard deviation602.0243348
Coefficient of variation (CV)0.5874180063
Kurtosis-1.223178906
Mean1024.865306
Median Absolute Deviation (MAD)522.4050757
Skewness0.01657401958
Sum1506552
Variance362433.2997
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[1.000e+00 2.068e+03], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1 1 0.1%
 
1391 1 0.1%
 
1389 1 0.1%
 
1387 1 0.1%
 
1383 1 0.1%
 
1382 1 0.1%
 
1380 1 0.1%
 
1379 1 0.1%
 
1377 1 0.1%
 
1375 1 0.1%
 
Other values (1460) 1460 99.3%
 
ValueCountFrequency (%) 
1 1 0.1%
 
2 1 0.1%
 
4 1 0.1%
 
5 1 0.1%
 
7 1 0.1%
 
ValueCountFrequency (%) 
2068 1 0.1%
 
2065 1 0.1%
 
2064 1 0.1%
 
2062 1 0.1%
 
2061 1 0.1%
 

implicacion
Categorical

MISSING
HIGH CORRELATION
Distinct count5
Unique (%)0.3%
Missing18
Missing (%)1.2%
Memory size11.6 KiB
Alta
857
Media
368
Muy_Alta
 
144
Baja
 
83
ValueCountFrequency (%) 
Alta 857 58.3%
 
Media 368 25.0%
 
Muy_Alta 144 9.8%
 
Baja 83 5.6%
 
(Missing) 18 1.2%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length8
Mean length4.629931973
Min length3
Scatter

incremento_salario_porc
Real number (ℝ≥0)

Distinct count15
Unique (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.20952381
Minimum11
Maximum25
Zeros0
Zeros (%)0.0%
Memory size11.6 KiB
Mini histogram

Quantile statistics

Minimum11
5-th percentile11
Q112
median14
Q318
95-th percentile22
Maximum25
Range14
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.659937717
Coefficient of variation (CV)0.2406346025
Kurtosis-0.3005982221
Mean15.20952381
Median Absolute Deviation (MAD)3.055173307
Skewness0.8211279756
Sum22358
Variance13.39514409
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[11. 11.5 14.5 19.5 22.5 25. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
11 210 14.3%
 
13 209 14.2%
 
14 201 13.7%
 
12 198 13.5%
 
15 101 6.9%
 
18 89 6.1%
 
17 82 5.6%
 
16 78 5.3%
 
19 76 5.2%
 
22 56 3.8%
 
Other values (5) 170 11.6%
 
ValueCountFrequency (%) 
11 210 14.3%
 
12 198 13.5%
 
13 209 14.2%
 
14 201 13.7%
 
15 101 6.9%
 
ValueCountFrequency (%) 
25 18 1.2%
 
24 21 1.4%
 
23 28 1.9%
 
22 56 3.8%
 
21 48 3.3%
 

mayor_edad
Categorical

CONST
HIGH CORRELATION
Distinct count1
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
Y
1470
ValueCountFrequency (%) 
Y 1470 100.0%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length1
Mean length1
Min length1
Scatter

nivel_acciones
Categorical

Distinct count4
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
0
631
1
596
2
158
3
 
85
ValueCountFrequency (%) 
0 631 42.9%
 
1 596 40.5%
 
2 158 10.7%
 
3 85 5.8%
 

Composition

Contains charsFalse
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length1
Mean length1
Min length1
Scatter

nivel_laboral
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count5
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.063945578
Minimum1
Maximum5
Zeros0
Zeros (%)0.0%
Memory size11.6 KiB
Mini histogram

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q33
95-th percentile4
Maximum5
Range4
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.106939899
Coefficient of variation (CV)0.5363222319
Kurtosis0.3991520554
Mean2.063945578
Median Absolute Deviation (MAD)0.8324753575
Skewness1.025401283
Sum3034
Variance1.22531594
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[1. 1.5 2.5 3.5 5. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1 543 36.9%
 
2 534 36.3%
 
3 218 14.8%
 
4 106 7.2%
 
5 69 4.7%
 
ValueCountFrequency (%) 
1 543 36.9%
 
2 534 36.3%
 
3 218 14.8%
 
4 106 7.2%
 
5 69 4.7%
 
ValueCountFrequency (%) 
5 69 4.7%
 
4 106 7.2%
 
3 218 14.8%
 
2 534 36.3%
 
1 543 36.9%
 

num_empresas_anteriores
Real number (ℝ≥0)

ZEROS
Distinct count10
Unique (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.693197279
Minimum0
Maximum9
Zeros197
Zeros (%)13.4%
Memory size11.6 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q34
95-th percentile8
Maximum9
Range9
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.498009006
Coefficient of variation (CV)0.9275254455
Kurtosis0.01021381669
Mean2.693197279
Median Absolute Deviation (MAD)2.059758434
Skewness1.026471112
Sum3959
Variance6.240048994
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0. 0.5 1.5 4.5 8.5 9. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1 521 35.4%
 
0 197 13.4%
 
3 159 10.8%
 
2 146 9.9%
 
4 139 9.5%
 
7 74 5.0%
 
6 70 4.8%
 
5 63 4.3%
 
9 52 3.5%
 
8 49 3.3%
 
ValueCountFrequency (%) 
0 197 13.4%
 
1 521 35.4%
 
2 146 9.9%
 
3 159 10.8%
 
4 139 9.5%
 
ValueCountFrequency (%) 
9 52 3.5%
 
8 49 3.3%
 
7 74 5.0%
 
6 70 4.8%
 
5 63 4.3%
 

num_formaciones_ult_ano
Real number (ℝ≥0)

ZEROS
Distinct count7
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.799319728
Minimum0
Maximum6
Zeros54
Zeros (%)3.7%
Memory size11.6 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q33
95-th percentile5
Maximum6
Range6
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.289270621
Coefficient of variation (CV)0.4605656896
Kurtosis0.494992986
Mean2.799319728
Median Absolute Deviation (MAD)0.9743440233
Skewness0.5531241711
Sum4115
Variance1.662218734
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0. 1.5 3.5 6. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2 547 37.2%
 
3 491 33.4%
 
4 123 8.4%
 
5 119 8.1%
 
1 71 4.8%
 
6 65 4.4%
 
0 54 3.7%
 
ValueCountFrequency (%) 
0 54 3.7%
 
1 71 4.8%
 
2 547 37.2%
 
3 491 33.4%
 
4 123 8.4%
 
ValueCountFrequency (%) 
6 65 4.4%
 
5 119 8.1%
 
4 123 8.4%
 
3 491 33.4%
 
2 547 37.2%
 

puesto
Categorical

HIGH CORRELATION
Distinct count9
Unique (%)0.6%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
Sales Executive
326
Research Scientist
292
Laboratory Technician
259
Manufacturing Director
145
Healthcare Representative
131
Other values (4)
317
ValueCountFrequency (%) 
Sales Executive 326 22.2%
 
Research Scientist 292 19.9%
 
Laboratory Technician 259 17.6%
 
Manufacturing Director 145 9.9%
 
Healthcare Representative 131 8.9%
 
Manager 102 6.9%
 
Sales Representative 83 5.6%
 
Research Director 80 5.4%
 
Human Resources 52 3.5%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length25
Mean length18.0707483
Min length7
Scatter

salario_mes
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count1349
Unique (%)91.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6502.931293
Minimum1009
Maximum19999
Zeros0
Zeros (%)0.0%
Memory size11.6 KiB
Mini histogram

Quantile statistics

Minimum1009
5-th percentile2097.9
Q12911
median4919
Q38379
95-th percentile17821.35
Maximum19999
Range18990
Interquartile range (IQR)5468

Descriptive statistics

Standard deviation4707.956783
Coefficient of variation (CV)0.7239745541
Kurtosis1.005232691
Mean6502.931293
Median Absolute Deviation (MAD)3631.446085
Skewness1.369816681
Sum9559309
Variance22164857.07
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 1009. 2004. 2981. 5463.5 5487.5 ... 13118. 13999.5 15982. 19035.5 19999. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2342 4 0.3%
 
6142 3 0.2%
 
2741 3 0.2%
 
2559 3 0.2%
 
2610 3 0.2%
 
2451 3 0.2%
 
5562 3 0.2%
 
3452 3 0.2%
 
2380 3 0.2%
 
6347 3 0.2%
 
Other values (1339) 1439 97.9%
 
ValueCountFrequency (%) 
1009 1 0.1%
 
1051 1 0.1%
 
1052 1 0.1%
 
1081 1 0.1%
 
1091 1 0.1%
 
ValueCountFrequency (%) 
19999 1 0.1%
 
19973 1 0.1%
 
19943 1 0.1%
 
19926 1 0.1%
 
19859 1 0.1%
 
Distinct count4
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
Alta
459
Muy_Alta
432
Media
303
Baja
276
ValueCountFrequency (%) 
Alta 459 31.2%
 
Muy_Alta 432 29.4%
 
Media 303 20.6%
 
Baja 276 18.8%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length8
Mean length5.381632653
Min length4
Scatter
Distinct count4
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
Alta
453
Muy_Alta
446
Media
287
Baja
284
ValueCountFrequency (%) 
Alta 453 30.8%
 
Muy_Alta 446 30.3%
 
Media 287 19.5%
 
Baja 284 19.3%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length8
Mean length5.408843537
Min length4
Scatter

satisfaccion_trabajo
Categorical

MISSING
HIGH CORRELATION
Distinct count5
Unique (%)0.3%
Missing76
Missing (%)5.2%
Memory size11.6 KiB
Alta
828
Media
354
Muy_Alta
 
136
Baja
 
76
ValueCountFrequency (%) 
Alta 828 56.3%
 
Media 354 24.1%
 
Muy_Alta 136 9.3%
 
Baja 76 5.2%
 
(Missing) 76 5.2%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length8
Mean length4.559183673
Min length3
Scatter

sexo
Categorical

MISSING
HIGH CORRELATION
HIGH CORRELATION
Distinct count5
Unique (%)0.3%
Missing199
Missing (%)13.5%
Memory size11.6 KiB
3
739
2
328
4
 
130
1
 
74
ValueCountFrequency (%) 
3 739 50.3%
 
2 328 22.3%
 
4 130 8.8%
 
1 74 5.0%
 
(Missing) 199 13.5%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length3
Mean length3
Min length3
Scatter

viajes
Categorical

Distinct count3
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
Travel_Rarely
1043
Travel_Frequently
277
Non-Travel
 
150
ValueCountFrequency (%) 
Travel_Rarely 1043 71.0%
 
Travel_Frequently 277 18.8%
 
Non-Travel 150 10.2%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length17
Mean length13.44761905
Min length10
Scatter

Correlations

Missing values

Sample

First rows

abandonoanos_companiaanos_con_manager_actualanos_desde_ult_promocionanos_en_puestoanos_experienciacarreraconciliaciondepartamentodistancia_casaedadeducacionempleadosestado_civilevaluacionhoras_extrahoras_quincenaidimplicacionincremento_salario_porcmayor_edadnivel_accionesnivel_laboralnum_empresas_anterioresnum_formaciones_ult_anopuestosalario_messatisfaccion_companerossatisfaccion_entornosatisfaccion_trabajosexoviajes
0Yes650NaN8Life SciencesNaNSales141Universitaria1SingleAltaYes801Alta11Y0280Sales Executive5993BajaMediaAlta3.0Travel_Rarely
1No1071NaN10Life SciencesNaNResearch & Development849Secundaria1MarriedMuy_AltaNo802Media23Y1213Research Scientist5130Muy_AltaAltaMedia2.0Travel_Frequently
2Yes0002.07OtherNaNResearch & Development237Secundaria1SingleAltaYes804Media15Y0163Laboratory Technician2090MediaMuy_AltaMedia2.0Travel_Rarely
3No8033.08Life SciencesNaNResearch & Development333Universitaria1MarriedAltaYes805Alta11Y0113Research Scientist2909AltaMuy_AltaAlta3.0Travel_Frequently
4No222NaN6MedicalNaNResearch & Development227Universitaria1MarriedAltaNo807Alta12Y1193Laboratory Technician3468Muy_AltaBajaAlta3.0Travel_Rarely
5No763NaN8Life SciencesNaNResearch & Development232Universitaria1SingleAltaNo808Alta13Y0102Laboratory Technician3068AltaMuy_AltaAlta3.0Travel_Frequently
6No100NaN12MedicalMuy_AltaResearch & Development359Master1MarriedMuy_AltaYes8010Muy_Alta20Y3143Laboratory Technician2670BajaAltaMuy_Alta4.0Travel_Rarely
7No100NaN1Life SciencesNaNResearch & Development2430Universitaria1DivorcedMuy_AltaNo8011Alta22Y1112Laboratory Technician2693MediaMuy_AltaAlta3.0Travel_Rarely
8No981NaN10Life SciencesNaNResearch & Development2338Secundaria1SingleMuy_AltaNo8012Media21Y0302Manufacturing Director9526MediaMuy_AltaMedia2.0Travel_Frequently
9No777NaN17MedicalAltaResearch & Development2736Universitaria1MarriedAltaNo8013Alta13Y2263Healthcare Representative5237MediaAltaAlta3.0Travel_Rarely

Last rows

abandonoanos_companiaanos_con_manager_actualanos_desde_ult_promocionanos_en_puestoanos_experienciacarreraconciliaciondepartamentodistancia_casaedadeducacionempleadosestado_civilevaluacionhoras_extrahoras_quincenaidimplicacionincremento_salario_porcmayor_edadnivel_accionesnivel_laboralnum_empresas_anterioresnum_formaciones_ult_anopuestosalario_messatisfaccion_companerossatisfaccion_entornosatisfaccion_trabajosexoviajes
1460No540NaN5MedicalNaNResearch & Development2829Secundaria1SingleAltaNo802054Media14Y0113Research Scientist3785MediaMuy_AltaMedia2.0Travel_Rarely
1461Yes302NaN20MarketingNaNSales2850Secundaria1DivorcedAltaYes802055Media13Y1343Sales Executive10854MediaMuy_AltaMedia2.0Travel_Rarely
1462No2069NaN21MarketingNaNSales2439Secundaria1MarriedAltaNo802056Media11Y1402Sales Executive12031BajaMediaNaN2.0Travel_Rarely
1463No9713.010MedicalNaNResearch & Development531Universitaria1SingleAltaNo802057Alta19Y0202Manufacturing Director9936MediaMediaAlta3.0Non-Travel
1464No400NaN5OtherMediaSales526Secundaria1SingleAltaNo802060Media18Y0102Sales Representative2966Muy_AltaMuy_AltaMedia2.0Travel_Rarely
1465No5304.017MedicalNaNResearch & Development2336Master1MarriedAltaNo802061Muy_Alta17Y1243Laboratory Technician2571AltaAltaMuy_Alta4.0Travel_Frequently
1466No771NaN9MedicalNaNResearch & Development639Secundaria1MarriedAltaNo802062Media15Y1345Healthcare Representative9991BajaMuy_AltaMedia2.0Travel_Rarely
1467No630NaN6Life SciencesNaNResearch & Development427Master1MarriedMuy_AltaYes802064Muy_Alta20Y1210Manufacturing Director6142MediaMediaMuy_Alta4.0Travel_Rarely
1468No980NaN17MedicalNaNSales249Secundaria1MarriedAltaNo802065Media14Y0223Sales Executive5390Muy_AltaMuy_AltaMediaNaNTravel_Frequently
1469No421NaN6MedicalMuy_AltaResearch & Development834NaN1MarriedAltaNo802068Muy_Alta12Y0223Laboratory Technician4404BajaMediaMuy_Alta4.0Travel_Rarely